Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
TensorRT Optimization
# TensorRT Optimization
Llama 3.1 8B Instruct FP8
FP8 quantized version of Meta Llama 3.1 8B Instruct model, featuring an optimized transformer architecture autoregressive language model with 128K context length support.
Large Language Model
Transformers
L
nvidia
3,700
21
Featured Recommended AI Models
Empowering the Future, Your AI Solution Knowledge Base
English
简体中文
繁體中文
にほんご
© 2025
AIbase